Writer Identification through Information Retrieval: The Allograph Weight Vector

نویسندگان

  • Ralph Niels
  • Franc Grootjen
چکیده

We show a number of promising results in writer identification, by recasting the traditional information retrieval (IR) problem of finding documents based on the frequency of occurrence of their terms. In IR, the tf-idf is a well-known statistical measure that weighs the importance of certain terms occurring in a database of documents. Here, writers are searched on the basis of the frequency of occurrence of particular character shapes: the allographs. The results show a high retrieval score. Moreover, by using the af-iwf (allograph frequency inverse writer frequency) measure, qualitative and quantitative analyses can be made that elaborate on the particular allograph shapes that lead to a successful writer identification. In this paper, we sketch the application of these techniques in forensic science.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Handwriting Analysis based on Segmentation Method for Prediction of Human Personality using Support Vector Machine

Handwriting analysis is a method to predict personality of an author and to better understand the writer. Allograph and allograph combination analysis is a scientific method of writer identification and evaluating the behavior. To make this computerized we considered six main different types of features: (i) size of letters, (ii) slant of letters and words, (iii) baseline, (iv) pen pressure, (v...

متن کامل

Allograph Based Writer Adaptation for Handwritten Character Recognition

Writer adaptation is the process of converting a generic (writer-independent) handwriting recognizer into a personalized (writer-dependent) recognizer with improved accuracy for a particular user. While training the generic recognizer uses large amounts of data from several writers, the adaptation process uses only a few samples from a single user. In this paper we present a) an automatic appro...

متن کامل

Writer Identification and Retrieval Using a Convolutional Neural Network

In this paper a novel method for writer identification and retrieval is presented. Writer identification is the process of finding the author of a specific document by comparing it to documents in a database where writers are known, whereas retrieval is the task of finding similar handwritings or all documents of a specific writer. The method presented is using Convolutional Neural Networks (CN...

متن کامل

Information Retrieval Based Writer Identification

In this paper, we apply an Information Retrieval model for the writer identification task. A set of local features is defined by clustering the graphemes produced by a segmentation procedure. Then a textual based Information Retrieval model is applied. After a first indexation step, this model no longuer requires image access to the database for responding to a specific query, thus making the p...

متن کامل

Handwritten Document Analysis for Automatic Writer Recognition

In this paper, we show that both the writer identification and the writer verification tasks can be carried out using local features such as graphemes extracted from the segmentation of cursive handwriting. We thus enlarge the scope of the possible use of these two tasks which have been, up to now, mainly evaluated on script handwritings. A textual based Information Retrieval model is used for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008